Ha, Eun Young. Modeling Discourse Structure and Temporal Event Relations for Automated Document Summarization with Markov Logic Networks. (under the Direction of Modeling Discourse Structure and Temporal Event Relations for Automated Document Summarization with Markov Logic Networks
نویسندگان
چکیده
HA, EUN YOUNG. Modeling Discourse Structure and Temporal Event Relations for Automated Document Summarization with Markov Logic Networks. (Under the direction of James C. Lester.) Recent years have seen significant progress in natural language processing. A key challenge posed by many natural language applications ranging from text summarization to question answering and machine translation is designing an effective discourse processing framework. Linguistic phenomena that hold across sentence boundaries are still not well understood, even though they provide crucial knowledge for integrating the information communicated by individual sentences. Our work addresses automatic processing of natural language spanning multiple sentences. More specifically, it concerns modeling discourse structure and temporal event relations in support of automated document summarization. The models for discourse structure and temporal event relations are learned from natural language corpora in a supervised manner by utilizing the Markov logic machine learning framework. Markov logic is a probabilistic extension of first-order logic that allows formulae to be violated. In contrast to first-order logic in which a possible world has a binary value (true or false), Markov logic assigns a weight to each formula, reflecting the strength of the constraint the formula represents. A Markov logic network is a set of weighted firstorder clauses that represent objects in the domain, together with constants. An implemented prototype of a document summarizer utilizes a discourse structure model and a temporal event relation model in order to identify salient content from a given document and to structure the identified salient content into a coherent summary. The discourse structure model is learned from the RST-DT corpus, which consists of a collection of newspaper articles that are manually annotated with rhetorical structure based on Rhetorical Structure Theory. The temporal event relation model is learned from the TempEval-2 corpus, which contains a set of newpaper articles that are manually annotated with events and time expressions following the TimeML specifications for events and temporal relations. Evaluations suggest that the Markov logic network can effectively model discourse structure and temporal event relations. Modeling Discourse Structure and Temporal Event Relations for Automated Document Summarization with Markov Logic Networks.
منابع مشابه
Exploring the Effectiveness of Lexical Ontologies for Modeling Temporal Relations with Markov Logic
Temporal analysis of events is a central problem in computational models of discourse. However, correctly recognizing temporal aspects of events poses serious challenges. This paper introduces a joint modeling framework and feature set for temporal analysis of events that utilizes Markov Logic. The feature set includes novel features derived from lexical ontologies. An evaluation suggests that ...
متن کاملEvent and Temporal Expression Extraction from Raw Text: First Step towards a Temporally Aware System
Extracting temporal information from raw text is fundamental for deep language understanding, and key to many applications like question answering, information extraction, and document summarization. Our long-term goal is to build complete temporal structure of documents and use the temporal structure in other applications like textual entailment, question answering, visualization, or others. I...
متن کاملSystematic literature review of fuzzy logic based text summarization
Information Overloadrq is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...
متن کاملNCSU: Modeling Temporal Relations with Markov Logic and Lexical Ontology
As a participant in TempEval-2, we address the temporal relations task consisting of four related subtasks. We take a supervised machine-learning technique using Markov Logic in combination with rich lexical relations beyond basic and syntactic features. One of our two submitted systems achieved the highest score for the Task F (66% precision), untied, and the second highest score (63% precisio...
متن کاملJointly Identifying Temporal Relations with Markov Logic
Recent work on temporal relation identification has focused on three types of relations between events: temporal relations between an event and a time expression, between a pair of events and between an event and the document creation time. These types of relations have mostly been identified in isolation by event pairwise comparison. However, this approach neglects logical constraints between ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010